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Abstract 

^O . We introduce and investigate a new model of a finite number of particles jumping forward on the real 

line. The jump lengths are independent of everything, but the jump rate of each particle depends on the 
relative position of the particle compared to the center of mass of the system. The rates are higher for those 
left behind, and lower for those ahead of the center of mass, providing an attractive interaction keeping the 
particles together. We prove that in the fluid limit, as the number of particles goes to infinity, the evolution 
of the system is described by a mean field equation that exhibits traveling wave solutions. A connection to 
extreme value statistics is also provided. 
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1 Introduction 



Interacting particle systems always show interesting behavior and represent a challenge for rigorous treatment. 
In this paper we introduce and investigate a new type of model which — to the best of our knowledge — has not 
emerged in the literature before. The system consists of a finite number of particles moving forward on the 
real line R. The center of mass of these particles is computed and, given a configuration, each particle jumps 
0^ . after an exponential waiting time that depends on its relative position compared to the center of mass. The 
more a particle lags behind the center of mass, the less its mean waiting time is; on the other hand, the more 
a particle is ahead of the center of mass, the more its mean waiting time is. When the exponential time has 
passed, the particle jumps forward a random distance, independent of everything else, drawn from a probability 
distribution concentrated on the non-negative half-line. While doing so, it naturally moves the center of mass 
forward by 1/n times its jump length as well, where n is the number of particles. Thus the dynamics has an 
effect of keeping the particles together, while randomness creates fluctuations in the system. 

Several real-life phenomena can be modeled this way: the evolution of prices set by agents trying to sell the 
same product in a market, the position of goats in a herd, etc. Mathematical finance is an area of active research, 
and many processes arising in finance are modeled with interacting particle systems, see e.g. the monograph on 
stochastic portfolio theory by Fernholz 14 , Banner, Fernholz and Karatzas [3], and the survey by Fernholz and 
Karatzas [T5]. Herding behavior has drawn serious interest in the physics literature, a few examples are Vicsek, 
Czirok, Ben- Jacob, Cohen and Shochet [3T]> Czirok, Barabasi and Vicsek |10| . Ballerini, Cabibbo, Candelier, 
Cavagna, Cisbani, Giardina, Lecomte, Orlandi, Parisi, Procaccini, Viale and Zdravkovic [3]. 

The problem of competing individuals is interesting even without dependence of the dynamics on the actual 
position of the particles. A series of papers have been written on models where the competing particles evolve 
according to i.i.d. increments: Ruzmaikina and Aizenman |26| . Arguin [T], Arguin and Aizenman [2J, Shkolnikov 
27 . The main point of investigation in these papers is characterizing point processes for which the joint 
distribution of the gaps between the particles is invariant under the evolution. 

Several authors have considered continuous models in which the motion (drift) of the particles depends on 
their rank in the spatial ordering; see e.g. Banner, Fernholz and Karatzas [4], Pal and Pitman [24], Chatterjee 
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and Pal [8], Shkolnikov [25J [2JJJ [3D] ■ These models are central objects of study in modeling capital distributions 
in equity markets. Branching diffusion-type models have also been investigated with a linear interaction be- 
tween particles, see e.g. Greven and Hollander [IB], Englander [IT] and the references therein. Linearity of the 
interaction is an important ingredient in these papers making the analysis somewhat easier. 

Closer to the present work are jump processes with interaction between the jumping particles. A related 
model is introduced in ben-Avraham, Majumdar and Redner [5], where particles compete with a leader who 
looks back only at the contestant directly behind him. Interaction between the leader and the followers is realized 
via a linear dependence of the mean jump length on the relative position of the leader and the followers. A 
process with rank-dependent jump rates is investigated in Greenberg, Malyshev and Popov |17] . and one with 
similar, but randomized jump rates in Manita and Shcherbakov [53]. Maybe the closest model to our system is 
the one featured in Grigorescu and Kang [19j . where particles jump in a configuration-dependent manner, and 
the authors give special focus to the situation where the dependence of particle advances on the configuration 
is via the center of mass of the particles. 

In our case, positive jump lengths are independently chosen, and the jump rate of a particle depends on 
its position relative to the center of mass. This jump rate function is non-increasing and non-negative, hence 
clearly cannot be linear in nontrivial cases. The setting is thus genuinely different from situations in the above 
examples. The non-increasing jump rate function creates a monotonicity in the system which keeps the particles 
together. This gives rise to a traveling wave-type stationary distribution. We address the following questions. 

• The two-particle case is, of course, tractable. We compute the stationary distribution of the gap between 
the two particles for several choices of the jump length distribution. However, we cannot handle the 
situation even for three particles. (The process is not reversible.) 

• If the number of particles goes to infinity, we expect that the motion of the center of mass converges to 
a deterministic one. This motion, together with the distribution of the particles, satisfy a deterministic 
integro-differential equation in this so-called fluid limit. We call the resulting equation the mean field 
equation. We write up this equation by heuristic arguments and, in some cases, we solve it for a traveling 
wave form, i.e. the stationary solution as seen from the center of mass. Even finding the traveling wave 
solution to this mean field equation is nontrivial. The reason for expecting a deterministic limiting equation 
is the averaging property of the dynamics. We in fact prove this phenomena in this paper later on for 
some cases. As the motion of the center of mass becomes deterministic, the particles decouple, each 
independently following the deterministically moving center of mass. Therefore in the limit it does not 
matter if we consider a deterministic cloud of particles (the limit coming from the empirical distribution of 
n — ¥ oo particles) , or a single particle in the corresponding distribution that is driven by the deterministic 
center of mass. The evolution of the fluid limit can therefore be called a mean field dynamics. Of 
course, the motion of the center of mass must be chosen appropriately to be consistent with the cloud 
or distribution of particles. In the traveling wave case, the distribution, as seen from the center of mass, 
becomes stationary, and the speed of the center of mass becomes constant in time. 

• Some traveling wave solutions lead us to Gumbel distributions, which naturally pose the question whether 
an extreme value theory interpretation can be given to the process. An idea by Attila Rakos shows that 
there is indeed such a connection. 

• Under restrictive assumptions on the jump rates, we prove that the evolution of the process in the fluid 
limit indeed satisfies the deterministic integro-differential equation we formulated by heuristic arguments. 
We use weak convergence methods and uniqueness results on the integro-differential equation. Since the 
center of mass is not a bounded function of the particle positions, standard weak convergence techniques 
need to be extended to our situation. 

This last point is the most interesting from a mathematical point of view. Our proof is a standard fluid limit 
argument, somewhat similar to those in Manita and Shcherbakov [23J and Grigorescu and Kang [Tj5]: we use 
an extended notion of weak convergence, tightness, and martingales to show convergence to the unique solution 
of the limiting integro-differential equation. As opposed to Greenberg, Malyshev and Popov [17] , asymptotic 
independence of the particles (as the number of particles tends to infinity) will be a consequence of the proof, 
rather than a method to rely on. This is because our case of dependence through the center of mass does not 
seem to provide an easy a priori way of showing asymptotic independence of the particles. 

We start by precisely describing the model in Section [2] We formulate our main results in Section [3] first, 
we formulate the limiting mean field equation together with some traveling wave solutions, and we also state the 
fluid limit theorem. These are the results we believe to be the most important, and additional statements can 
also be found in subsequent sections. We end this section with a collection of open questions. We proceed by 
analyzing the two-particle case in Section^] The mean field equation is investigated in Section[S] In particular, 
a connection with extreme value statistics is heuristically explained in Section 15.21 Section [5] contains our 



fluid limit argument under some restrictive conditions on the jump rates, and finally Section [7] supports, via 
simulation results, the idea that the fluid limit should hold in other cases as well. 

2 The model 

To model the phenomena described in the Introduction, we present the following model. The model consists 
of n particles on the real line. Let us denote the position of the particles by x\ (t) , x% (i) , . . . , X n (i), where t 
represents time and Xi(t) £ R for i = 1, . . . ,n. We now describe the dynamics. Given a configuration, the mean 
position (center of mass) of the particles is m n (t) — — Y27=i x i (*)• Now let us denote by w : R — > R + the jump 
rate function, a positive and monotone decreasing function. The dynamics is a continuous-time Markov jump 
process. The i th particle, positioned at X{ (t) at time t, jumps with rate w (Xi (t) — m n (t)). That is, conditioned 
on the configuration of the particles, jumps happen independently after an exponentially distributed time, with 
the parameter of the i th particle being w (xi (t) — m n (t)). 

When a particle jumps, the length of the jump is a random positive number Z from a specific distribution. 
This is independent of time and the position of the particle and all other particles as well. We scale the jump 
length so that EZ = 1, and for technical reasons we assume that Z has a finite third moment. If this distribution 
is absolutely continuous then its density is denoted by <p. 

Our focus is on the empirical measure of the n particle process at time t: 

1 - 
More generally, we are interested in the path of the empirical measure, fj, n (•) = (fi n (t)) t>0 , with initial condition 

M°) = £EILi<Sx4(o)- 

We introduce the following notation in order to abbreviate formulas, which is also well suited for weak 
convergence type arguments. For a function / : R — > R and a probability measure P on R let 

oo 

(/,P>:= [ f(x)P(dx). 



We also use the following notation: (/ (x) , P) = (/, P). Using this notation, we have m n (t) = (x, /j, n (i)). 

Now we can present the generator of the process acting on the integrated test functions as above. Let C b 
denote the space of bounded and continuous functions from R to R, and let Id denote the identity from R to 
R: Id (x) = x. Define 

(2.1) H:={feC b : |/| < 1} U {Id} . 

This set of functions will be important later on. For a function / £ H we have: 

L (/, im it)) = ((E (/ (x + Z))-f (x)) w(x~ m n (t)) , /x„ {€)) . 

The particular choice of H is so that we can apply the generator L to the center of mass m n (t). 

3 Results 

The case of a fixed number n of particles is difficult to treat. Of course the n = 2 case is doable: in Section [4] we 
look at two specific cases of the model, and in both we compute the stationary distribution of the gap between 
the two particles. At the end of the section we illustrate the difficulties that arise for more than two particles 
in Figure^] 

Our main interest is in the large n limit (so called fluid limit) of the particle system. Let us assume that 
the jump length distribution has a density if. Heuristically, in the fluid limit the empirical measure \i n (t) of 
the particles at time t becomes an absolutely continuous measure with density g(-,t) whose time evolution is 
given by the so-called mean field equation 

X 

(3.1) — —l— = -<w{x-m{t))e{x,t)+ j w(y-m(t))g(y,t)(p(x-y)dy, 

— OO 

where 

oo 

(3.2) m (t) = I xq (#, t) dx 



is the mean of the probability density g(-,t). The first term on the right hand side of (|3.1j) accounts for the 
particles hopping forward from position x at time t: at time t there is a density of g (x, t) particles at x, who 
jump forward with rate w (x — m (£)). The second, integral term on the right hand side of (|3.ip accounts for 
the particles who are at position y < x jumping forward at time t to position x: at time t there is a density 
of g (y, t) particles at y, who jump forward with rate w (y — m(t)), and need to jump forward exactly x — y to 
land at x, the probability of this being proportional to (p (x — y). 

Our first results concern solutions to the mean field equation (|3.ip in the form of a traveling wave 

(3.3) g(x,t) = p(x — ct), 

where c is the constant velocity of the wave. Traveling waves of this form represent stationary distributions as 
seen from the center of mass, which travels at a constant speed c. Substituting (|3.3p into the mean field equation 
(|3.1|l . and then writing x instead of x — ct for simplicity everywhere, we arrive at the following equation: 



(3.4) - cp' (x) = — w (x) p (x) + / w(y)p(y)tp(x-y)dy. 

— CO 

Recall that the jump rate function w is a non-increasing function; any non-constant w gives a strong enough 
attraction to keep the particles together, i.e. there is a probability density traveling wave solution p. This is 
formulated in the following theorem for exponentially distributed jump lengths. 

Theorem 3.1. In the specific case when the jump length distribution is exponential with mean one, the solution 
to (|3.4[) for general jump rate function w is 

(3.5) p(i)=ife« U ' , 

where K is a constant. If w is non-constant then for an appropriate normalizing constant K , p of (|3.5|) is a 
probability density with at least exponentially decreasing tails. In our model the speed of the wave is determined 
by the fact that p is centered, thus c is the unique solution of 

CO x 

(3.6) / xe° dx = 0. 



There are several specific jump rate functions w for which the resulting stationary distribution is of interest. 
In particular, the case of w (x) = e~@ x , where j3 > 0, leads to the generalized Gumbel distribution: 






Corollary 3.2. If w (x) = e p x , where (3 > 0, then from Theorem \3.1\ we obtain for the stationary density 

(3.7) p p (x) 

where ip is the digamma function: 



(*) 

*w- rw 



r(i) 

Specifically for /3 = I the stationary distribution is the centered standard Gumbel distribution. The velocity of 
the wave is 

c=i e -*(*). 

The Gumbel distribution often arises in extreme value theory (Gumbel [3D]), e.g. this is the limiting distribu- 
tion of the rescaled maximum of a large number of independent identically distributed random variables drawn 
from a distribution that has no upper bound and decays faster than any power law at infinity (e.g. exponential 
or Gaussian distributions). More generally, the feth largest value, once rescaled, follows the generalized Gumbel 
distribution (Bertin [5], Clusel and Bertin [5], Gumbel [30] ). Therefore the question arises whether there is an 
underlying extremal process in the dynamics. It turns out that there is a connection between the mean field 
model and extreme value theory: this is discussed in Section I5~21 

The validity of the mean field equation is strongly supported by simulation results for various special cases 
of the model, see Section [7] We expect this to be true in quite some generality (for fairly general jump rate 
functions and jump length distributions). 



The main result of the paper is to rigorously validate this approximation. That is, our goal is to show that in 
the fluid limit — when the number of particles tends to infinity — the mean field equation (|3.ip holds. However, 
when trying to rigorously prove this, technical difficulties may arise. Therefore we impose some restrictions on 
the jump rate function and the jump length distribution. Our main restriction is that we suppose that w is 
bounded: w (x) < a. We also impose some additional (smoothness) restrictions on w, but boundedness is the 
main restriction. Our restrictions on the jump length distribution are light: we assume that it has a finite third 
moment. 

In order to precisely state our main theorem, some preparations are needed. First notice that the mean 
field equation (|3.1j) can be rewritten as the evolution of a probability measure \x (■) using test functions from H. 
Moreover, we do not have to assume that the jump length distribution has a density either. Define 



At J (/* (')) : = (/, M (*)> - </, fi (0)) - J ((£ (/ (x + Z)) - J (x)) w(x-m («)) , n (s)) 
(3.8) 

= </,mW)-(/,m(0)>- I L{f,n{s))ds 



where 

m(t) = (x,fi(t)) . 

We say that /u (•) satisfies the mean field equation if 

(3.9) A tJ (»(■)) = 

for every t > and / <G H . 

The position of the center of mass of the particles plays an important role in our model, and therefore we 
always assume it exists (and is finite). Consequently, it is natural to look at \i n (t) as an element of V\ (R), the 
space of probability measures on R having a finite first (absolute) moment. In order to talk about convergence 
in V\ (R) we need to define a metric on V\ (R). We use the so-called 1-Wasserstein metric. For fx, v £ V\ (R) 
define the set of coupling measures 

II (/i, v) := {tt £ Mi (RxR) : w (• X R) = /z, 7r (R X •) = u} , 

where Mi (R x R) denotes the space of probability measures on R x R. The 1-Wasserstein metric on the space 
V\ (R) is defined as 



(3.10) d\ (fi, v) := inf < / \x — y\ w (dx, dy) : 7r £ H (/x, v) 




for fi,i/ePi (R). Much is known about the metric space {Vi (R) , d\)] we will state what we need later. 

The natural space in which to consider /i„ (•) is the Skorohod space D ([0, oo), V\ (R)): we know that /x ra (•) is 
a piece- wise constant function in V\ (R), having jumps whenever a particle jumps. Similarly, for any test function 
/ (for instance / £ H), the natural space in which to consider (f,/J, n (■)) i s the Skorohod space D ([0,oo),R). 

In order to prove weak convergence of {fx n (•)} t »>i m * ne appropriate Skorohod space, we have to make the 
following assumptions on the initial distribution of the particles. 

Assumption 1. Our first assumption is that 

(3.11) lim limsupP -Vb(0)|>I =0. 

Assumption 2. Our second assumption is that there exists a constant c such that for every e > 



- 5^ 1 [|xi (0)1 >cs- 2/3 



> e/4 < e/3. 



(Note: the exponent 2/3 has no special significance. This is simply because we assume E (-Z 3 ) < oo for simplicity. 
We could assume E (Z 2+e ) < oo for some small e > 0, in which case we could replace the exponent 2/3 with an 
exponent of 2/ (2 + e) in Assumption 2.) 

A sufficient and simple condition for both Assumption 1 and 2 to hold is that 



su P e(-!-^mo)| 3 ) < 



Assumption 3. Our third assumption is that 

(3.12) ^ n (0)^„^ 

in V\ (R), where v £ V\ (R) is a deterministic initial profile and => n denotes weak convergence. 
We are now ready to state our main theorem. 

Theorem 3.3. Suppose that the non-increasing jump rate function w is bounded, and that it is differentiable 
with bounded derivative. Suppose the jump length Z is scaled such that ¥,Z = 1 and that it has a finite third 
moment. Suppose that Assumptions 1, 2 and 3 from above hold. Then 

(3.13) M-)=>-n|*(0 

in D ([0, oo), V\ (R)), where fi (•) is the unique deterministic solution to the mean field equation (|3.9p with initial 
condition v. 

The proof of this theorem is in Section [5] The argument follows the standard method for proving weak 
convergence results. Namely, based on Prohorov's theorem, we need to do three things: 

• Prove tightness of {/i n (-)} n >i m the Skorohod space D ([0, oo), V\ (R)). 

• Identify the limit of {fi n (-)}„>i : here we essentially have to show that the limit solves the deterministic 
mean field equation. 

• Prove uniqueness of the solution to the mean field equation with a given initial condition. 

3.1 Open questions 

We end this section with some open questions. 

• Theorem 13.31 is a law of large numbers type result, so it is natural to ask about the fluctuations around 
this first order limit. In particular, how does the variance of the empirical mean m n (t) scale with n and 
i? Preliminary simulations suggest that this variance grows linearly in t, and for fixed t decays in n as 
n~ a for some a > 0. (The simulations are inconclusive of what the exponent a should be.) 

• Once the order of fluctuations of the empirical mean m n (t) is known, can we determine the limiting 
distribution of m n (t) with appropriate scaling? 

• Going back to the finite particle system: can we really not deal with a fixed number (n > 3) of particles 
and determine the stationary distribution viewed from the center of mass in this case? 



• In Theorem 13.31 we make assumptions on the jump rate function w, the jump length distribution, and the 
initial distribution of the particles. Most of the assumptions are fairly weak, except for the assumption 
that w is bounded, which is the main restrictive assumption. We believe that these assumptions can be 
weakened, and in particular we are interested in removing the assumption that w is bounded. Can one 
prove the same result for unbounded w, e.g. w (x) — e~ x 1 

4 Two-particle system 

As an introduction to the model, we look at the simplest case: when n = 2, i.e. when there are two competing 
particles. In particular, we look at two specific cases of the model, and in each case we compute the stationary 
distribution of the gap between the two particles. At the end of the section we illustrate the difficulties that 
arise in the case of n > 3 particles. 

4.1 Deterministic jumps 

First, let us look at the model when the length of the jumps is not random but a deterministic positive number. 
We always have the freedom to scale the system so that this length is equal to unity. So, in this subsection, we 
look at the model with deterministic jumps of length 1. 

In the case of two particles, x\ (t) and X2 (£), we are interested in how far away they move from each other, 
thus in the gap g (t) = \xi (t) — X2 {t)\. For simplicity let us suppose that Xi (0) — X2 (0) € Z, so that g (t) G N 
for all t > 0. In this case g (t) is a continuous-time Markov process on N, with infinitesimal generator Q given 
by the model in the following way. Since the gap can only change by 1, Q is a tridiagonal matrix. If the gap is 



0, i.e. x\ (t) — X2 (t), then both particles jump with rate w (0) and whichever one does, the gap changes to 1. 
Therefore the rate at which the gap changes from to 1 is 2w (0) and thus 

Qo,o = -2w(0), Q 0)1 = 2w (0) . 

If the gap between the particles is k > 0, then the one in the lead jumps with rate w (-|), while the one which 
is behind jumps with rate w (— §)• This means that the gap increases by 1 with rate w (|) and decreases by 1 
with rate w (— f ) and so 

Qk,k-i=w (--) , Q k ,k = - (w f -- J +w f - J J , Qk,k+i=w(- 

When the jump rate function w is a monotone decreasing, non-constant function, then g (t) is a positive 
recurrent birth-and-death process with stationary distribution n — (7Tq , ~k\ , ~ki , . . . ) given by 



tto 



OO T-rfc-1 f i \ 

1 + 2 £n^-i) 

rfc-l 






TTfc = 2tt ±L *=° - y ' A/ , for k > 1. 



Example 4.1 Exponential jump rate function 

If «; (a;) = e - ^, where /3 > 0, then the stationary distribution becomes 



1 

^o = 3s -7 

fc=i 

^ = ss ^7> forfc>l, 

1 + 2 Ee"^ 
fc=i 

which shows Gaussian decay. 

Example 4.2 Jump rate function is a step function 
Let 

la if x < 0, 

w [6 if0<x, 
where a > 6 > 0. Then a short calculation shows that the stationary distribution is 

a — b 



TTO 



a + b 

k 



„a— b f b , 

i: k = 2—- - , for k > 1, 
a + b \ a 



which shows exponential decay. 



4.2 Jump length distribution having a density 

Consider again the case of two particles, but now let us suppose that the distribution of the length of the 
jumps has a density, and denote this by if as mentioned in Section [5J As before, we are interested in the gap 
g (t) = \xi (t) — xi (t)\ between the two particles. Denote by p (<?, t) the probability density of the gap g at time 
i, which is concentrated on [0, ex)). The time evolution of the gap is completely described by the time evolution 
of p, which is described by the master equation 

a 
P (g) = -P(g) O (-3/2) + w (3/2)) + I p{y)w (y/2) <p(g- y) dy 



00 00 

+ P(y)w {-y/2) ip{y~g)dy+ p (y) w (-y/2) (p(y + g) dy, 
9 



(a) 
(b) 



H^ 




Figure 1: Events that change the gap length. 



where the overdot denotes time derivative. The first term on the right hand side accounts for the case when 
the gap length is g and one of the two particles jumps (Fig. QJi and [I}}). The second term accounts for the 
case when a gap of length g is created from a previous gap of length y < g due to the leader jumping forward 
(Fig. [TJ;). The third term accounts for the case when a gap of length g is created from a previous gap of length 
y > g due to the laggard jumping forward, but still remaining the laggard (Fig. [Hi). The fourth term accounts 
for the case when a gap of length g is created from a previous gap of length y > due to the laggard jumping 
forward and overtaking the leader (Fig. QJ). It is easy to check that this conserves the total probability, i.e. 
J™ P (9)dg = 0. 

4.2.1 Stationary distribution of the gap 

Our goal is to compute the stationary density, i.e. p such that p (g) = 0. It is unlikely to be able to find a closed 
formula for the stationary density in full generality (for general tp and w), and so we restrict ourselves to the 
special case when <p(x) = e~ x l[x > 0] and w (x) — e~@ x , where (3 > 0. In this case the equation for stationarity 
becomes 



~P (ff) (' 



e 2 +e 2 



p(y)e~ 



-^^{a-v) dy+ p (y)e^ L e- {y - 9) dy+ p{y)e^e' iy+9) 

g 



dy. 



If we look at the last integral as the sum of the integral from to g and the integral from g to 00, then the 
equation above can rearranged to get 

g 00 

= -p (g) (e^ + e~^) + fp (y) e~ 9 [e(^ +1 > + e(^ 1 ) y ] dy + f p(y) e^ 1 ) 9 [e 9 + e' 9 ] dy, 



or more simply, after dividing by 2, 

'/V 



(4.1) 



= — p (g) cosh 



00 

p{y) cosh i I - - lj y\ dy + cosh(g) / p(y)e^~ 1 > v dy. 



4.2.2 Special case of = 2 

In the special case of j3 = 2, (|4.1[) simplifies to 



(4.2) 



P (9) cosh (g) = e 9 p (y) dy + cosh (g) / p (y) dy 



This equation can be solved explicitly. Using the fact that L p (g) dg = 1, and thus J p (y) dy — 1— J"' p (y) dy, 

we arrive at 

a 

p (g) cosh (g) = - sinh (g) / p (y) dy + cosh (g) , 



and thus dividing by sinh (g) we have (for g > 0) 



(4.3) 



coth (g) p(g) = - p (y) dy + coth (g) . 



Let us differentiate both sides of f|4.3[) with respect to g. After rearranging, using the identity 1/ sinh 2 (g) — 1 
coth (g) — 2, and dividing by coth(g), we arrive at 

V (.<?) = ( coth (g) - —77-73 )p(9) 



coth (g) / sinh (g) cosh (g) 

The general solution to this linear equation is 

sinh( 3 ) 1 

-P (3) = c ' 2 • 

cosh (g) cosh (g) 

The restriction L p (g) dg — 1 specifies the constant: 

-1 nco 



/ p (3) dff = 



thus c = and consequently the density p is 



'cosh (5) 



tanh (5) 



= 1 + C, 





(4-4) p(g)= X 2 

cosh (5) 

4.2.3 The case of /? 7^ 2 

In the case of j3 7^ 2, one way of attacking (j4.ll) is differentiating both sides with respect to g twice, and then 
using (I4.1J) again to get rid of the integrals. This way we arrive at a second-order homogeneous differential 
equation with non-constant coefficients: 

cosh (?f\ p" (g) + 08+1) sinh (?f) p> (g) + (t + i \ cos h (?f\ p (g) = 0. 



Dividing by cosh ( 4f ) : 



(4-5) p" (g) + 09 + 1) tanh (^) p' ( 5 ) + (^ + p ( 5 ) = 0. 

Before looking at the solutions to this, it is worth looking at large g asymptotics. If g is large, then tanh I 4p ) 
1, and therefore the following equation approximately holds: 



The two solutions to 



a--(.; + i.a+('^ + ^)=.i 



are Ai = — j3/2 and A2 = — (1 + (3/2), and therefore the density p(g) is asymptotically a constant multiple of 
either e~^ 9 ' 2 or e~ ( - 1+,3 ' 2 - )9 . Since for /3 = 2 we know by (|4.4[) that p (5) is asymptotically e~ 2s , we suspect 
that for general j3 the density p(g) is asymptotically a constant multiple of e~( 1+/3 / 2 ) 9 . This suspicion will be 
confirmed shortly. By differentiating, it can be checked that a solution to (J4.5I) is 

(4.6) p(g) = 



(cosh(4f 



Using the method of variation of constants, the other solution can be found as well, and therefore the general 
solution to (14.51) is 



1 /( C ° Sh (^)) dy 
( 4 - 7 ) P(9) = ci- —7TTVP +C2 ~r Ta xxi+2//3 ' 



cosh ( % ) ] ( cosh 



(*) 



where c\ and C2 are constants. In our case, one constraint on the constants is the fact that J Q p{g)dg = 1. 
The other constraint is given by the following. We know that p (g) is a continuous density function on (0,oo), 
and due to (|4.ip we know that 

oo 

(4.8) \im p(g)= fp(y)e^- 1 ) y dy. 



So from (|4.7|) and (I4.8P the second constraint on the constants is 

If /«„.\\ 2 //3 



C\ = Ci 



^^7»1^ 



, , , ,„ , , + 2 ^ " I / . /«.,\\ 1 + 2 /' 3 





cosh(f)p /P i (cosh(^) 



However, using the substitution u = e 2 y one can see that 



/ (cosh(^)V^ "" ' 



showing that C2 = 0. So for general /3 the stationary density of the gap is 

1 



V (9) = c- 



1+2//3 ■ 



(cosh(f)) 
with c being the normalizing constant. 

4.3 Many-particle case 

In the n > 3 case with deterministic jumps, the particles Xi (t) , . . . , x n (t) form a continuous-time Markov 
process on an n dimensional lattice. Viewed from the mean position of the particles, this becomes a Markov 
process on an n — 1 dimensional lattice. However, even in the n = 3 case the combinatorics of the problem 
becomes too difficult to find a closed form for the stationary distribution (see Figure [5]) . The model with the 
jump length distribution having a density presents similar difficulties in the n > 3 case. (As we have seen, the 
problem is not completely solved even in the n — 2 case.) Understanding the finite particle case is the subject 
of future work. 

5 Mean field 

We now turn our attention to the mean field approximation of the model and study the solutions of the mean 
field equation (|3.1[) . First of all, using the fact that ip is a probability density function, we can check that the 
mean field equation (|3.ip conserves the total probability, i.e. f^° g t ' ' dx = 0. 

An important quantity is the average speed of the mean of the density. This speed can be expressed as a 
simple integral by differentiating equation (|3.2p with respect to time and using the mean field equation (|3.ip : 



3C 



771 (t) = J X — - — dx 



xw (x — m(t)) g(x,t)dx + / / xw (y — m(t)) g(y,t)ip(x — y)dydx 

—00 —00 

OO CO 

xw (x — m (t)) g (x,i) dx + / w (y — m (£)) g (y, t) I xip (x — y) dxdy 

-00 y 

00 00 

/ xw (x - m (t)) Q (x, t) dx + / w{y~m{t))g{y,t)dy J r I yw (y - m(t)) g(y,t) dy 

-00 —00 —00 

o 

(5.1) = / w{y-m(t))g{y,t)dy. 

10 




Figure 2: In the n — 3 case with deterministic jumps the particles viewed from the mean position form a 
continuous-time Markov process on a 2 dimensional lattice with directed edges, as seen above. The coordinates 
in the nodes show (3 (xi (t) — m„ (t)) , 3 (x2 (t) — m„ (t)) , 3 (23 (t) — m n (£))). (We multiply by 3 only so that 
we need not write fractions in the figure.) The rate of going from one node to another on a particular edge is 
indicated on the edge itself, however, once again the argument of the jump rate function w (■) is multiplied by 
3 so that we do not need to write fractions in the figure. The edges going outward from the origin are colored 
in blue, the ones which are going inward toward the origin are colored in red, while the remaining edges are 
colored green. Since the rates on the incoming red arrows are larger than those on the outgoing blue arrows, one 
can see that there is a drift towards the origin, and therefore one expects there to be a stationary distribution 
on this lattice. However, there can be no detailed balance since there are directed cycles in this graph, and 
therefore calculating the stationary distribution becomes difficult. 
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Our goal is to find a stationary solution to the mean field equation Q3.1[) in the form of a traveling wave, see 
In this case p is centered, since 



xp (x) dx — / (x — ct) p (x — ct) dx = / xp (x — ct) dx — ct = I xg (x, t) dx — ct = m (t) — ct = 



We now turn to proving Theorem 13. 1 1 

Proof of Theorem \3.1l In the specific case when the jump length distribution is exponentially distributed we 
solve (|3.4j) for general jump rate function w. Substituting tp(x) = e~ x l[x > 0] into p.4[) we get 

X 

(5.2) - cp' (x) = -w (x) p (x) + w(y)p(y)e~ {x ~ v) dy. 



First, we take the derivative of the integral term in (J5.2D . Using the property of the exponential function, that 
its derivative is itself, we get 

w(y)p(y)e~^~ v) dy\ = w{x)p(x)- / w (y) p(y) e~ {x - y) dy 




= w (x) p (x) — w (x) p (x) + cp' (x) — cp' (x) , 
where we used (|5.2p in the second line. It follows that for some constant D we have 



w (y)p(y) e v 'dy = cp(x) + D. 

-oo 

Plugging this into (I5.2p we arrive at 

(5.3) — cp' (x) + w (x) p (x) = cp (x) + D. 

To show that D = 0, let us integrate both sides of the equation with respect to x from — oo to oo (by integrating 
from — R to R and taking the limit as R — > oo). The integral of the first term on the left hand side is since 
the total probability is conserved by the mean field equation, and the integral of the second term on the left 
hand side is c according to f|5. If) . Since p is a density we also have that the integral of the first term on the 
right hand side is c as well, therefore 

R 



1 

and thus D = 0. Now rearranging (|5.3[) we get 



lim / Ddx = 0, 

R— >oo 

-R 



p' (x) = [-w(x)-l)p (x) 



from which we arrive at (I3.5[) . 

If w is non-constant, then from (15.11) it follows that 



w (— oo) > c > w (oo) . 

From this it immediately follows that p decays at least exponentially as x — > ±oo. 

The last assertion of the theorem follows from the fact that p is centered, which we have seen above. □ 

Alternative proof of Theorem \3.1\ by LP. Toth. Let us consider a moving walkway that goes to the left with 
speed c, above which there is also a rope that goes to the right with speed v >> c (see Figure [3]). We define 
the following stochastic process, which in the v — > oo limit is the same process that is described by 



• All the particles are on the moving walkway going left with speed c and a particle at x jumps up and 
catches the rope with rate w (x). 

• If a particle is hanging to the rope, then it lets go and falls back with rate v. 
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Figure 3: The process described above. Particles move to the left with speed c and a particle at x jumps up 
and catches the rope with rate w (x). The rope is moving to the right with speed v » c. Particles on the rope 
let go with rate v. 



It is clear that since the speed of the rope is v, the jump length of the particle to the right is exponentially 
distributed with parameter v • ~ = 1, which is the same length as in our model. Furthermore, in the »-)oo 
limit the jump is instantaneous, which is exactly what we are studying. Let us denote by p d (x) and p v u (x) 
the density of the particles down on the walkway and up on the rope, respectively. The sum of the two is a 
probability density therefore 

oo 

r {p v d {x)+pl{x))dx = l. 
Our plan of action is the following: we calculate the stationary densities of this process for fixed finite v, and 



then look at the result as v 
p (x) is the solution to (jS~ 

(5.4a) 
(5.4b) 



-> oo. What we expect is that lim 1 ,_ i . 00 p v u (x) = and lim l ,_ ) . 00 p v d (x) = p (x), where 
The master equations governing the time evolution of the densities p v d and p" u are 

dp d {x) d Pd {x) 



dt 
dp v u (x) 



dx 



w (x) p d (x) + vp v u (x) 



= { _ v) d^M +w{x)pl{x) _ vplix) 



dt dx 

The first term on the right hand sides corresponds to the particles that move horizontally (along the walkway or 
along the rope), while the second term corresponds to the particles that jump up and the third term corresponds 
to the particles that jump down. Setting the left hand sides to in (I5.4[) we arrive at the equations for the 
stationary densities: 

.MM 



(5.5a) 

(5.5b) 

Summing (|5.5a|) and (|5.5bp we get: 

and thus 



= c- 



dx 



w (x) p d (x) + vp v u (x) 



= i _ v) dp^M +wix)pVAx) _ vplixy 







dx 

d_ 

dx 



{cpl (x) - vpl (x)) , 



K = c Pd (x) - vpl (x) 
for some K constant. However, taking the limit x — > oo we see that K 



and thus 



(5.6) 



p"n 0*0 = -Pd ( x ) ■ 



Putting (|5.6p back into (I5.5al) and rearranging we arrive at 



dp d (x) 
dx 



from which 



\w{x)-\ .)p'„U i. 



/Ow-i)* 

Ke° 



Pd( x ) 
follows for some K constant. Note how p d is independent of v, and also that lim^^oo p" u i 



as expected. □ 
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5.1 Specific examples 

In this subsection we look at the stationary distribution p of (|3.5p in the case of specific jump rate functions. In 
all four cases we also determine the speed of the traveling wave from Theorem 13.11 Details of calculations are 
given only in the case of Example 15.11 the calculations in the other examples are quite simple from Theorem 

EH 



Example 5.1 Exponential jump rate function 

If w (x) — e~P x , where (3 > 0, then from Theorem 13. II we obtain 

where K is an appropriate normalizing constant. Integrating this over R gives us the value for K and thus the 
density becomes 

p,(x) = ^ T e-K^)-^ +l2 ^ i) . 
To check this let us calculate the integral: 

nri OO 

pp (x) dx = / — -. — r-e v 3 )~ e dx 

J r(4) 

— oo — oo \P J 

oo oo 

using the substitutions y = x + ° E ^"> and z — e~^ y . From Theorem 13. II the speed of the wave c is determined 
by the fact that pp is centered, so to compute c we must compute the mean of the density. Using similar 
substitutions as before we get: 

OO OO oo 

xpp(x)dx = + / (x-\ \pp{x)dx = 1 -r-- / ye~ y ~ l * ' dy 

— oo \@ J ~~ °° 

oo 

log(c/3) ( P f \og{z) A ^_ z ( dz\ _ log(c/3) 1 [,„„,^A-i -z 



> r(i)V P { PzJ P ^r(i) 







iH«> + ?g 



which is zero when 



where ip is the digamma function: 



P 



, M r' (a) 



r(x) 

Therefore the stationary distribution in this case is given by (13.7[) . Note the exponential tail of pp on one side, 
and the double-exponential tail on the other side. Specifically, in the case of P = 1 the stationary distribution 
is the centered standard Gumbcl distribution: 

Pl {x)=e-^- e - { ^\ 

where 7 is Euler's constant. Moreover, it can be seen that pp is a rescaled generalized Gumbel distribution with 
parameter 1//3 (Bertin [6], Clusel and Bertin [9 , Gumbel [20]). As mentioned in Section [3l the (generalized) 
Gumbel distribution often arises in extreme value theory, so it is natural to ask whether there is an underlying 
extremal process in the dynamics. It turns out that there is a connection between the mean field model and 
extreme value theory when k = 1//3 is an integer: this is discussed in Section [5.21 

Example 5.2 Jump rate junction is a step function 
Let 

la if x < 0, 

W \b if 0<x, 
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where a > b > 0. Then from Theorem 13. II the solution to (|5.2p is 

, , \Ke^- x ) x if as<0, 
[Xev<= V* if < ar, 

where the normalizing constant is 

(a-c)(c-b) 

c(a-b) ' 
The speed of the wave follows from Theorem 13.11 

a + b 



and thus the stationary distribution is 

(5-7) p b ( x ) = ^Z* e -S$\*\ } 



2 (a + b) 

which is a Laplace distribution. 

Example 5.3 Jump rate function is ... 
Let 

a if x < 1, 

(5.8) w (x) = { -&=bx + <*£ if — 1 < a: < 1, 

if 1 < x, 

where a > b > 0, thus w is a continuous function which is constant on the intervals (— oo, —1) and (l,oo) and 
linear on [—1,1]. Then from Theorem 13. II the solution to (|5.2[) is 

fife^+t^ 1 ) 1 ifx<-l, 

Pa.b (x) = | ife-^^+C^- 1 )* if — l<a;< 1, 

[ife^+f^ 1 ) 1 ifKx, 

where K is the normalizing constant. Note that for c = ^^ the density p a ^ is symmetric, therefore according 
to Theorem 13.11 this is the speed of the wave. Therefore the stationary distribution is 

( Ke 2^TE)-7^E\ x \ if jajl > 1, 

Pa,b (X) = < a-b 2 

[ife-jRiy 1 if|x|<l. 

This distribution has exponential tails, but around it behaves as a normal distribution. 

Example 5.4 Jump rate function is arccotangent 
Let 

w (x) = arccot x, 

where arccot x is defined as the inverse of cotx on the interval (0,7r), therefore w is a monotone decreasing 
smooth function. From Theorem 13. II the solution to (15.21) is 



p(x) = Ke x d ^ arccot:r - 1 )+2T lo g( 1 + :l;2 ) 

where K is the normalizing constant. Note that for c = ? the density p is symmetric, therefore according to 
Theorem 13.11 this is the speed of the wave. Consequently the stationary distribution is 

p(x) — Ke x d arccota: -i)+-j io g (i+i- 2 )^ 

5.2 Connection with extreme value theory 

In the previous subsection we have seen that when <p(x) — e~ x l{x > 0] and w (x) = e~P' x then the (rescaled 
generalized) Gumbel distribution arises as the stationary distribution of the mean field system. This is inter- 
esting, because at first sight there is no extremeness in the model. In this section we discuss the connection 
between the model and extreme value theory. 
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5.2.1 Standard Gumbel case 

Let us start with the standard Gumbel case, i.e. (3 — 1. Recall that according to our observation in the 
introduction, we can assume that the center of mass moves with a constant velocity c, and we consider a single 
mean field particle X(t) that jumps with rate e ct ~ x ^ t \ so 

P (X (t) changes between t and t + dt) = e ct ~ x[t) dt + o (dt) . 

When this jump occurs, X(t) increases by a random number drawn from an exponential distribution with 
parameter 1 (denoted by Exp(l) in the sequel). 

How can we view this as an extreme process? The answer to this question was given by Attila Rakos, 
whose idea is the following. As time goes forward, let us take gradually more and more new independent and 
identically distributed Exp(l) random variables, and let us look at the distribution of the maximum of these 
random variables, which we denote by Y (t). Specifically, at time t let there be N (i) = e ct /c random variables 
altogether, so Y (t) is the maximum of N (t) i.i.d. random variables. (In this subsection we neglect mathematical 
precision (such as the fact that we should have defined N (t) to be integer- valued: N (t) = [e ct /c]) for the sake 
of clarity of the argument.) There are N' (t) dt = e ct dt new random variables trying to "break the record" 
between t and t + dt, and thus 

/ \\ N '( t ) dt 

P (Y (t) changes between t and t + dt) = 1 - ( 1 - e' Y{t > 



= N' (t) dfe-^W + (n> (i) dfe-HO) 



and if Y (t) changes, then it jumps according to Exp(l) . Here we use the memoryless property of the exponential 
distribution, i.e. the fact that if T is exponentially distributed then 

P(T > s + t\T > s)=P(T >t). 

Thus we can conclude that the two processes are basically the same. Furthermore, we know that Y (t) — 
log N (t) follows the standard Gumbel distribution for large t (see, for example, Gumbel [20] or the excellent 
survey by Clusel and Bertin [5]): 

lim P (Y (t) - log N (t) <x) = e~ e ~* , 

t— >oo 

and therefore since log N (t) = ct — log c, it follows that 

lim P (Y (t) -ct<x)= e -e- {m+losc) . 

t— >-oo 



Thus by the connection we have 



lim P (X (t) -ct<x) 



t— foo 

and since the distribution of X (t) — ct is centered, it follows that 

c = e 1 , 
where 7 = 0.5772 ... is Euler's constant. 

5.2.2 Generalized Gumbel case 

The idea above extends to the general case with a slight modification. In the following let k = 1//3 be a fixed 
positive integer. We use k and 1//3 interchangeably. 

In general the mean field particle X (t) jumps with rate e^^ ct ~ x ^' , so 

P (X (t) changes between t and t + dt) = e 0(ct ~ xit)) dt + o {dt) , 

and if it changes then it jumps according to Exp(l). 

How can we view this as an extreme process? The idea is similar to the one in the previous section. Once 
again, as time goes forward, let us take gradually more and more new independent and identically distributed 
Exp(l) random variables. Specifically, at time t let there be N (t) = e^ ct /f3c random variables altogether. 
Denote by Yk (t) the k th largest value among these random variables at time t. We focus on Y (t) := kYk (t). 
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There are N' (t) dt = e@ ct dt new random variables trying to "break the record" between t and t + dt, and since 
Y (t) changes exactly at the same time Yfc (£) does, we have that 

/ v i \\ N'(t)dt 

P (Y (t) changes between t and t + dt) = 1 - ( 1 - e~ Yk{t > ' 



= N' (t) dte- Yh(t) + o (n 1 (t) dte' Yk{t A 

= e f><*-Y h (t) dt + f e fict-Y,,(t) d ^ 



and if Y (i) changes, then it jumps according to Exp(l). This follows directly from the fact that when Yfc (t) 
changes, then it jumps according to Exp(fc), which is due to the following: Yfc (t) changes when a new random 
variable jumps past the previous k th largest r.v., making the previous k th largest r.v. the new (k + l) s largest 
r.v., and thus the length of the jump that Yk (t) makes is the difference between the new k th and (k + l) s 
largest random variables. Since the random variables are i.i.d. Exp(l) variables, we know that this difference is 
exponentially distributed with parameter k (see, for example, Bertin [5]). 

Thus we can conclude that the two processes X (t) and Y (t) are basically the same. Furthermore, we know 
that Yfc (t), once rescaled, follows the generalized Gumbel distribution for large t. Specifically (see Clusel and 
Bertin [S] for details): 



Thus for Y (t) we have 



lim P (Yfc ft) - log N (t) G (a;,a: + dr))= „ 1 ,, e~ kx - e ' ' dx. 

t^ca [k — 1)! 



1 „_fc f _ e ~f dx __ 1/fc , ., r 



l^F(Y(t)-klo g N(t)e(x,x + dx)) = w -^e-^ *- = ^e— « dx, 
and therefore for the mean field particle X (t) we have that 

fim P (X(t) -I]ogJV(t) g (x,x + dx)\ = ^^e-^-^dx, 
so using the fact that log N (t) — (3ct — log /3c, we finally obtain 

lim F(X(t)-cte (x,x + dx))= ^— - e -( x+ ff-)- e y l 'dx. 

t-¥oo r (i/p) 

Since the distribution of X (t) — ct is centered, it follows that 

c =i e -*(?) 

8 

where \\j is the digamma function: 

T(x) 
Thus we have rediscovered the results of the previous subsection through extreme value theory arguments. 

6 Fluid limit 

In this section we prove Theorem 13.31 The section is organized as follows. In order to present and prove our 
results, we need to introduce notation, terminology, definitions and previously known results. This necessary 
background is introduced in Section 16.11 Then in Section 16.21 we formulate as separate theorems several in- 
termediate steps that lead to the main result and we prove Theorem 13.31 using these steps. In the following 
subsections we then provide proofs of the intermediate steps stated in Section 16.21 

6.1 Preliminaries 

Recall that our goal is to show that — given some assumptions — the path of the empirical measure process 
{/i„ (-)}„>i converges weakly in the Skorohod space D ([0,oo),Vi(M)) to /i(-), which solves the mean field 
equation (|3.9p . The goal of this subsection is to provide all the necessary preliminaries that we need during the 
proof. 
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We present important results from the theory of weak convergence that we use later; in particular, we focus 
on tightness results in Skorohod spaces. For an excellent introduction to weak convergence, we refer the reader 
to Billingsley [7|. Many results that we use are taken from Ethier and Kurtz [T2], and Jacod and Shiryaev [3T] 
is also a valuable source of information concerning convergence of processes. 

As in Theorem 13.31 from now on we assume that the jump rate function w is bounded, and we denote its 
lowest upper bound by a. To put it another way: a := lim^^-oo w (x) — sup,,, w (x). We also assume that w is 
differentiable and has a bounded derivative. Furthermore, in the following Z always denotes a random variable 
from the jump length distribution. As before, we assume that E (Z) = 1 and that Z has a finite third moment. 

6.1.1 Probability metrics 

Recall the definition (|3.10[) of the 1-Wasserstein metric. The following lemma (from Feng and Kurtz p~3l 
Appendix D]) summarizes what we need to know about the metric space (Vi (R) ,di). More can be found in 
e.g. Feng and Kurtz [T31 Appendix D]. 

Lemma 6.1. (Vi(M),di) is a complete and separable metric space. For /J, n ,[i E V\ (R), the following are 
equivalent: 

(1) lini^oo dx (//„, n) = 0. 

(2) For all continuous functions ip onM. satisfying \ip (x)\ < C (1 + |a;|) for some C > 0, 

lim (cp, /j, n ) = (ip, n) . 

n— >oo 

We introduce another probability metric that we use later on. For P, Q e V\ (R) and a set H of functions 
define 

(6.1) d n (P,Q) = sup \(f,P)-(f,Q}\. 

fen 

It is immediate that du is a pseudometric on V\ (R): du {P,P) = 0, du (P,Q) = du (Q,P), and the triangle 
inequality also holds. The only thing that does not necessarily hold is that du (P, Q) = implies P = Q. In 
order for this to hold, T-L has to be a large enough set of functions to be able to distinguish between different 
probability measures. There are various choices of T-L that give rise to notable metrics. For instance, if T-L is the 
set of indicator functions of measurable sets, then d-u is the total variation metric. If T-L is the set of half-line 
indicator functions then du is the Kolmogorov (uniform) metric. The choice of Ti = {/ € Cj, : |/| < 1} gives 
the Radon metric (that this is a metric follows from e.g. Billingsley [7J Theorem 1.2.]). Consequently the choice 
H = H (see (|2.ip ) also yields a metric. This is what we use in the following: dn denotes the metric on V\ (R) 
given by (|6.ip with W = H . This metric is used in Theorem 16.141 Corollary 16.151 and their proofs in Section 
16.51 For more on probability metrics we refer the reader to Gibbs and Su [16] for an excellent review. 

6.1.2 Skorohod spaces 

Let (E, r) be a general metric space. Denote by C ([0, oo), E) the space of continuous E- valued paths with the 
uniform topology. The Skorohod space D ([0, oo), E) consists of E- valued paths that are right-continuous and 
have left limits. We do not go into details about the topology on D ([0, oo), E), but we note that there exists a 
metric under which D ([0,oo),i?) is a complete and separable metric space if (E,r) is complete and separable 
(Ethier and Kurtz [T21 Chapter 3]). The topology generated by this metric is called the Skorohod topology on 
D ([0, oo), E). Note that C ([0, oo), E) is a subset of D ([0, oo), E) and it can be seen that the Skorohod topology 
relativized to C ([0, oo), E) coincides with the uniform topology there. For more on the Skorohod space we refer 
the reader to Billingsley [7[ Chapter 3] and Ethier and Kurtz [T21 Chapter 3]. 

The reason we introduced the Skorohod space in general is because the natural space in which to consider 
/!„ (•) is D ([0, oo), V\ (R)): we know that /i„ (•) is a piece-wise constant function in V\ (R), having jumps 
whenever a particle jumps. (We can define /i„ (•) to be right-continuous at its discontinuities.) Since V\ (R) is 
complete and separable under the Wasserstein metric, D ([0,oo),'Pi (R)) is also complete and separable under 
an appropriate metric. Similarly, for any test function / (for instance / G H), the natural space in which to 
consider (f,/J n (•)) is the Skorohod space D Q0,oo),R). 

6.1.3 Tightness in Skorohod spaces 

Recall our plan of action from the end of Section [5] 

• Prove tightness of {/j, n ( - )} n >i m ^ ne Skorohod space D ([OjOo),? 7 ! (R)). 
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• Identify the limit of {/j. n (•)} n >i : here we essentially have to show that the limit solves the deterministic 
mean field equation. 

• Prove uniqueness of the solution to the mean field equation with a given initial condition. 

In view of the first point, our goal here is to find checkable sufficient conditions for tightness in Skorohod spaces. 
Since all metric spaces we deal with are complete and separable, by Prohorov's theorem the concepts of tightness 
and relative compactness are equivalent in our setting, and therefore we use these two notions interchangeably. 
Following Perkins [25] , we say that a sequence is C-relatively compact if it is relatively compact and all weak 
limit points are a.s. continuous. 

Tightness in £>([0,oo),R) 

First, we state conditions for tightness in D ([0, oo), R). For y (•) 6 D ([0, oo), R) define 

(6-2) Jt(v(-))= sup \y(t)-y(t-)\, 

0<t<T 

the largest (absolute) jump in y (•) until T. Let {Y n (•)}„> i be a sequence of random variables in D ([0, oo), R). 
The following theorem, which is a combination of Billingsley Theorem 16.8.] and the corollary thereafter, 
characterizes tightness in D ([0, oo), R). 

Theorem 6.2. The sequence {Y n (•)}„>! is tight in D ([0, oo), R) if and only if the following three conditions 
hold: 

(i) 

(6.3) lim limsupP(|Y„ (0)| > L) = 0. 

L^OO 71—5.00 

(ii) For each T > 0, 

(6.4) lim lim sup P(J T (Y n (-)) >L) = 0. 

L^TCG 7i— >-oO 

(Hi) For each T > and e > 0, 

(6.5) lim limsupP inf max sup \Y n (t) — Y n (s)\ > s I = 0, 

<5^0 ji^oo y{ti} l<i<v a jte [t i _ 1) t i ) J 

where the infimum extends over all decompositions [tj_i, ti), 1 < i < v, of [0, T) such that ti — ij_i > S for 
1 < i < v. 

We now look at a condition which implies that any limit point is continuous, i.e. is in C ([0, oo), R). Following 
Ethier and Kurtz [L2J p. 147], for y (•) € D([0,oo),R) define 

(6.6) J (y (•)) - J e~ u min { J u (y (•)) , 1} du, 

o 

where J u (y(-)) is defined as in (J6.2I) . The following theorem characterizes continuity of a weak limit point 
(Ethier and Kurtz [T2] Theorem 3.10.2.]). 

Theorem 6.3. Suppose Y n (•) => Tl Y (•) in D ([0,oo),R). Then Y (•) is almost surely continuous if and only if 

J(Y n (-))^ n 0. 

We now give sufficient conditions for a sequence to be C-relatively compact. 
Theorem 6.4. Suppose the following two conditions hold. 

ft) 

(6.7) lim limsupP(|r„ (0)| > L) = 0. 

L ^OO 77 — 5-qq 

(ii) For each T > and e > 0, 

(6.8) lim lim sup P I sup \Y n (t) -Y n (s)\>e\ =0. 

*->0 „-yoo \ 0<t-s<8 

\0<s<t<T 
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Then {Y n (•)}._> i is C '-relatively compact in D ([0, oo),R). 

This theorem is stated in Grigorescu and Kang [THl Section 3] , but without a proof, and so for completeness 
we provide a short proof. 

Proof. Let us check tightness first: according to Theorem 16.21 we need to check that (|6.3p . (|6.4p . and (|6.5[) 
hold. Our first condition is exactly (J6.3D . From (J6.8D it follows that for every T > 0, Jt (Y n (•)) ^n 0, and 
consequently (|6.4p holds. To show that (J6.5I) follows from (|6.8[) . note that 



mf max sup |Y^ (i) - Y„ (s)\ < sup |Y„ (t) - Y n (s)\ , 

{*,} l<i<v st £i t ._ lt A 0<t-s<26 

0<s<t<T 

where the infimum extends over the same range as in Theorem 16.21 

Now let us check that any weak limit point is a.s. continuous. According to Theorem 16.31 we need to check 
that J (Y n (•)) =>„ 0. From (J6.8I) we know that for every T > 0, Jt (Y n (■)) =£•„ 0. For any fixed e > there 
exists a T such that e~ T < e/2. From (|6.6p it follows that J (Y n (•)) < TJt (Y n (•)) + e _T , and consequently: 



lim P(J(F„(-)) >£)< lim P(J T (F„(-)) > e/2T) = 0. D 

n— >oo n— foo 

Tightness in Skorohod spaces containing measure-valued paths 

Now let us turn to conditions that imply tightness in the Skorohod space of time-indexed measure-valued 
paths D ([0,oo),'Pi (R)). Our main tool is a slight modification of a theorem of Perkins [25j Theorem II. 4.1]. 
Perkins' theorem basically states that if we can check an extra condition, then C-relative compactness in 
D ([0, oo),Mf (E)) (where Mp (E) is the space of finite measures on the metric space E, with the topol- 
ogy of weak convergence) can be checked by checking the C-relative compactness of appropriate integrals in 
D([0,oo),R). 

Our modification of Perkins' theorem is the following. 

Theorem 6.5. A sequence of cadlag V\ (M)-valued processes {P n (•)} n >i * s C -relatively compact in 
D ([0, oo), Vi (R)) if the following conditions hold: 

(i) For every T > there exists a c — c (T) and a r = t (T) G (0,1) such that for every e > there exists a 
K = K T , e € K such that K < c/e 1 ~ T and 



supP sup P„ (t, (-oo, -K) U (K, oo)) > e < e. 

n \0<t<T / 

(«jj For every f € C&, {(/, P n (-))}„>i * s C -relatively compact in D ([0,oo),R). 

If one looks at the proof of Perkins' original theorem [3SJ Theorem II.4.1], then one can see that it holds 
true if one replaces Mp (E) with Mi (E) , the space of probability measures on E with the topology of weak 
convergence. However, our measure-valued process lives in a different space, namely /i„ (•) e D ([OjOo),? 3 ! (M)), 
where the Wasserstein space V\ (K) is endowed with the metric d\. It turns out that we cannot simply replace 
Mp (E) with V\ (K) in the theorem. Convergence in (Pi (M) ,di) is stronger than weak convergence: it is weak 
convergence and the convergence of the first absolute moment of the measure. It is this that causes the original 
proof of the theorem to break down. 

The proof of Theorem 16.51 is essentially the same as that of Perkins' theorem, but with a slight modification 
at one point, which requires that our condition (i) is stronger than in Perkins' original version. For completeness, 
we reproduce the whole proof, following the lines of Perkins [25] Section II. 4.]: this can be found in Section loTSl 

We use Theorem 16.51 in order to prove tightness of the empirical measure process {/i n (•)}«>! m Corollary 

EM ~ 

6.1.4 A few more facts that are used 

Before we move on to our results, let us finish the preliminaries with a few more things that we use later in the 
proofs (in particular in Section l6~4| . We rely on the continuous mapping theorem (Billingsley Section 2]). 
We do not need the theorem in full generality, only the following: 

Theorem 6.6. Suppose h is a continuous function from the metric space (S,d) to the metric space (S',d'). If 
X„ ^„ X in (S,d), then h(X n ) =>„ h(X) in (S',d'). 

Finally, we need the following two lemmas, which can be found as exercises in Ethier and Kurtz [121 Chapter 

3]. 
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Lemma 6.7. Let E and F be metric spaces, and let f : E —¥ F be continuous. Then the mapping x — > / o x 
from D ([0, oo), E) to D ([0 , oo) , F) is continuous. 

Lemma 6.8. The function 



/(*)(•) = J x(s)ds 



from D([0,oo),R) to D ([0, oo),R) is continuous. 

6.2 Main steps of the fluid limit procedure 

In this subsection we collect the intermediate steps that build up the proof of the main theorem, Theorem l3.3l 
leaving the proofs of these results to later subsections. We complete the proof of Theorem 13.31 at the end of 
the subsection. Recall our assumptions on the jump rate function w and the jump length Z which we assume 
throughout this section. 

Our first results deal with showing the tightness of {/i„ (-)} n >i m the Skorohod space D ([0,oo),Vi (R))- 

Theorem 6.9. Suppose Assumption 1 from Section^ holds. Then for any f £ C\, the sequence {(f,Hn ('))}n>i 
is C -relatively compact in D ([0, oo), R). 

We remark that although we only need the above result for continuous and bounded functions in the corollary 
below, continuity is not used in our proof, and with a slight modification in the proof the same can be proven 
for Lipschitz continuous functions. 

Corollary 6.10. Suppose Assumptions 1 and 2 from Section^ hold. Then {/i n (•)}„>! is C -relatively compact 
in D([0,oo),Vi 



Theorem 16.91 and Corollary 16.101 are proved in Section [ 

We remark that tightness can be proved in other ways as well, using the techniques of Ethier and Kurtz [T2l 
Chapter 3]. In particular, Thomas G. Kurtz communicated to us a different way of checking tightness, and we 
have included a sketch of his proof at the end of Section 16.31 

Our next group of results is concerned with identifying the limit of the sequence {/i n (■)} n >i" ^ n particular, 
our goal is to show that any weak limit point of {/j. n (•)} n >i solves the mean field equation. We prove this in 
two steps. Recall the definition (12.11) of H . 

Theorem 6.11. For every t > and every f £ H , 

(6.9) sup L4 S> / (M0)l -^ 

0<s<t n->oc 

P 

where > denotes convergence in probability. 

n— >oo 

Theorem 6.12. If /i„ (•) =>„ \i (•) in D ([OjOo),? 3 ! (R)), then for every t > and every f £ H , 

(6-10) Atjfa (•))=►„ A tJ ( M (-)) 

inR. 

An immediate corollary of these two theorems is the following. 

Corollary 6.13. Any weak limit point /z(-) of {/j, n (•)} n >i * n the Skorohod space D ([0, oo),? 3 ! (R)) solves the 
mean field equation almost surely. That is, if \x n (•) =>„ /! (•) in D ([0, oo), V\ (R)), then for every t > and 
every f £ H , A t j (/x (•)) = almost surely. 



The proofs of Theorem 16. Ill Theorem 16. 12[ and Corollary 16 . 131 can be found in Section [ 

Our final result concerns the mean field equation: the solution with a given initial condition is unique. 

Theorem 6.14. Suppose /i 1 (•) and /i 2 (•) are solutions to the mean field equation with initial conditions /ij and 
/io respectively. Denote by du (t) the distance dn ((J- 1 (t) ,/i 2 (£)) of the two measures at time t according to the 
metric defined by (|6.1[) with % = H . Then there exists a constant c such that 



(6.11) d H (t)<d H (0)e ct . 

Corollary 6.15. Suppose /i 1 (•) and /i 2 (•) are solutions to the mean field equation with the same initial condi- 
tion. Then /i 1 (•) = /i 2 (•), i.e. the solution to the mean field equation with a given initial condition is unique. 
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Let us now prove our main result. 



Proof of Theorem\3J& The sequence {fi n (•)}„>! is tight in D ([0, oo), V\ (R)) due to Corollary RTTU1 Corollary 
16. 131 shows that any limit point satisfies the mean field equation. Due to Assumption 3, any limit point satisfies 
the mean field equation with deterministic initial condition v. We know that the solution to the mean field 
equation with a given initial condition is unique ( Corollary 16. 15[) . Consequently, due to Prohorov's theorem, 
Mn (■) =^n I 1 (')i since we have proved tightness and the uniqueness of any limit point. □ 



6.3 Tightness 



In this subsection we prove Theorem 16.91 and Corollary 16.101 

Before starting the proof, let us introduce a particle system {ii (•)}*=! i related to {xi (-)}™ =1 , which we 
use in the proof. Initially, the two particle systems have the same configurations: x% (0) = Xi (0) for each 
i. Let {£i (-)} i= i be a particle system in which all the particles are independent from each other, all of the 
particles jump with rate a (recall that a = lim.r_j._oo w (x) = swp x w {%)), and whenever they jump, the jump 
length is a random variable from the same jump length distribution of the original system, independently of 
everything else. In other words, let Si (•) be independent Poisson processes with rate a for each i , let £J, ££, . . . 
be i.i.d. random variables from the jump length distribution, and define $i (t) — Xi (0) + 2Jfe=i £jt- Moreover, 
{xi (-)}" = i an d {ii (0}i=i can be constructed jointly on the same probability space in the following way. Define 
{ii (•)}{_! exactly as above, and whenever a particle Xi jumps, let Xi jump the same jump length with probability 
w (xi (t) — m„ (£)) /a; with the remaining probability let Xi stay in place. This joint construction leads to two 
advantageous monotonicity properties: first, 



for any i and t > 0, and second, 



3.12) 



Xi (t) > Xi (t) 



Xi (t) - Xi (s) > Xi (t) - Xi (s) 



for any i and t > s > 0. 

Proof of Theorem \6.(A Let / £ Cj, and denote by K the upper bound of |/|, i.e. \f (x)\ < K for all x. Define 
In (•) := (/, fin (•)> = 7[ __™=i f ( x i ('))• Clearly \I n (t)\ < K for any t. In order to prove tightness in D ([0, oo), R) 
we need to show that (|6.7|l and (|6.8p hold with Y n replaced with I n . Since / is bounded it is immediate that 
(16.71) holds. Now let us show that (|6.8[) holds. Our basic observation is that 



|I„ (*)-/»(*) | 



£(/(*«(*))-/(*,(*))) 



1 " 2K - 

< - E i/ (*< (*)) - / o* («))i ^ — E ! [*< (*) > x * ( s )i 



By monotonicity of Xi (•) and using (|6.12[) we have 



sup \I n (t) - I n (s)\ > e | < 

0<t-s<6 
\0<s<t<T 



< 



< 



Sup V" 1 [iBi (t) > _* (s)] > £ 

o< t - ;5 <5 n f— ' 

\0<s<t<T l - L 

f IK - N 

sup > 1 [xi (s + 25) > „, (s)] > e 

yO<s<T n fl J 

' 2K n N 

sup y^ 1 [xi (s + 25) > x t (a)] > e 

y o< s <T n ^ 



To abbreviate notation, define Y n j (s) := =£■ YH=i 1 [%i ( s + $) > ^« ( s )]- Now if sup 0<s<T Y n .2& (s) > e, then 
there exists an integer k, < k < \T/S\ — 1, such that Y n: $ (kS) > e/4. Suppose the contrary, that for every k 
the above expression is less than e/4; then since an interval of length 26 can intersect at most three neighboring 
intervals of length 6, it follows that sup 0<s<T Y n ^s (s) < 3e/4. Therefore 



sup Y n<2S (s) > e J < P (3fc, < fc < [T/5] - 1 : F„. 5 (M) > e/4) 

,0<s<T / 

= 1 - P (Vfc, < k < \T/5] - 1 : Y niS (kS) < e/4) 

= l-(P(r„^(0)<e/4)) rT/51 

= l-(l-P(y„,,(0)>e/4)) rT/51 , 
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where we used that for every i, ft (•) has independent and stationary increments. We bound the probability in 
the last line using Markov's inequality 

p rjr i [^ (s) > Xl (o)] > H j < e l( J2 1 [^ (s) > xi (o)] j J /^, 

where 

E ( ^ 1 [ft (<5) > ii (0)] j J = nP (ft (5) > ft (0)) + (n 2 - n) P (ft (S) > ft (0) and ft (5) > ft (0)) 

= n (1 - e- aS ) + (n 2 - n) (l - e~ a& f 
« na<5 + (n — n) (ai5) , 

from which (|6.8p follows immediately. □ 



Proof of Corollary 1 6'. i 01 In order to prove this corollary we need to check the two conditions of Theorem [ 
The second condition follows immediately from Theorem 16.91 It remains to show the first condition of Theorem 
16.51 Due to monotonicity of the Xi (£) in t, we have 

sup fln{t, (-00, —K)D (K, oo)) < sup n n (t, (-co, -K)) + sup n n (i, (if, oo)) 

0<i<T 0<t<T 0<t<T 

= /i„ (0, (-oo, -if)) + y, n (T, (if, oo)) , 
and consequently 

supP I sup fx n (i, (— oo, — if) U (if, oo)) > e 

n \0<t<T 

< supP {n n (0, (-co, -if)) + nn (T, (if, oo)) > e) 

n 

< supP (m„ (0, (-co, -if)) > e/2) + supP ( M „ (T, (if, oo)) > e/2) . 

n n 

Let us look at the second term: 

1 - 
H n (T,(K,oo)) = -J2^-[xi(T) > K] 

i=l 
1 n 1 n 

< - V 1 [xi (0) > K/2] + - V 1 [xi (T) - x % (0) > K/2] 

i— 1 z— 1 

^ n 1 n 

< - £ 1 [a* (0) > K/2] + - J2 1 fo ( r ) " £* (0) > #/2] , 



n ■* — ' n ■ 

i=l i=l 



and so consequently 

supP(^„(T, (if,oo)) > e/2) < supPOu„(0,(if/2,oo)) > e/4) 



sup P j i £ 1 [ft (T) - x, (0) > if/2] > e/4 j 



Let us look at the second term on the right hand side. Using Markov's inequality and the fact that the ft's are 
identically distributed, we have 

(1 n \ A 

- 53 1 [ft (T) - Xl (0) > if/2] > e/4 < -P (ft (T) - ft (0) > if/2) . 

Now using Markov's inequality again, we get that 

^P (ft (T) - ft (0) > K/2) = Jp ((ft (T) - ft (0)) 3 > if 3 /s) < -J^E ((ft (T) - ft (0)) 3 ) =: ^. 
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Now putting everything together we get 

supP( sup fj, n (t,(-(x,-K)U (K,oo)) >e J < sup P (/i„ (0, (-00, -iT)) > e/2) 

n \0<t<T / n 

c(T) 
+ su P P(/i„ (0, (K/2,oo)) > e/4) + -^, 

and we need the left hand side to be at most e, for which it is enough that the right hand side is at most e. 
So let T > and e > be fixed, and let us choose c= c (T), r = r (T), and if = Kt, e as follows: 

c(T) :=max{3c(T),8c 3 } 
r(T):= 1/3 

K TiE : =c (T) 1/3 e- 2 / 3 , 

where c is given by Assumption 2. Then K 3 > 3c(T)e~ 2 , and so c(T) / (K 3 e) < e/3. On the other hand 
K > 2ce -2 / 3 , and so using Assumption 2 we have 

sup P 0„ (0, (-oo, -X)) > e/2) < e/3 

n 

supP {n n (0, (if/2, oo)) > e/4) < e/3. 

n 

Putting all these together concludes the proof. □ 

As mentioned before, there are other ways to check tightness, and here we sketch Thomas G. Kurtz's proof 
(see also Kotelenez and Kurtz [22l Appendix]). First note that 

di{n,v)<-E\X-Y\, 

where X and Y are any random variables with distributions [i and v respectively. The paths of our particles 
{xi (•)}£=! are exchangeable, and so for any < F^ n( '' >-stopping time r we have 

(6.13) E (di (m„ (t) , fi n (r + u))) < E (^ (r + u) - a?i (r)) . 

For each T > 0, let S n (T) be the collection of < J 7 ^ >-stopping times r satisfying t <T. Then by (|6.13p and 

Aldous's criterion (see e.g. Ethier and Kurtz P2j Theorem 8.6]), {/i„ (■)}«> i i s tight in I? ([OjOo),? 3 ! (R)) if for 
each T > 0, 

lim limsup sup E (xi (t + u) — x\ (r)) = 0, 

and also condition (a) of [121 Theorem 7.2] holds. In our case, for a bounded jump rate function w, these 
conditions can be checked just like before, in the previous way of showing tightness. 

6.4 Identifying the limit 

In this section we prove Theorem 16.111 Theorem 16.121 and Corollary 16.131 

Proof of Theorem \6.11l Let t > and / € H be fixed from now on. As in the previous section, define I n (•) := 
(/, [i n (•)) = \ YTi=\ f ( x i ('))■ Calculation shows that 

LI n (t) = ((E (/ (x + Z))-f (x))w (x - m„ (t)) , ^ (t)) , 

which means that M n (t) := A t j (n n (■)) is a martingale in t with M n (0) = 0. Therefore by Doob's inequality 
(see e.g. Jacod and Shiryaev [3TJ p. 11]) we have that for any c > 0, 

sup M "i is) I ., 

x o< s < t " V V ^ 4E M 2 f 
sup M„ (a) > c < 5 < 

0<s<t / c 

So in order to show (|6.9p we need to show that 

(6.14) lim E(Af 2 (i)) =0. 
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We know that 

t 

N n {t):=M 2 n {t)- J LMl{s)ds 

o 

is a martingale with N n (0) = 0, and therefore 

(6.15, .(*(„)-.(/**;(.,*)-/«(**;(.,)*. 

\o / o 

The order of the expectation and the integral can be switched due to the Fubini-Tonelli theorem, because 
LM% (s) > as we will see shortly. Let us calculate LM% (£): 

LM I (t) = Um 1e (M 2 (t + *) - M I {t)\T t ) 



ton J^E ((M„ (i + eft) - M„ (i)) 2 | Ji) 



where we used that M n (t) is a martingale. Since LI n (s) is bounded, we have 

t+dt 

M n {t + dt) - M n (t) = (/„ (t + eft) - /„ (*)) - J LI n (s) ds 

) 
= (I n (t + dt)-I n {t)) + 0(dt) 

and so consequently 

\2 



LMl (t) = lim iE f(M n (t + rft) - M„ (i)f 

= lim 1e ((/„ (t + dt) - /„ (t)) 2 Ft) 

dt-K) dt \ ) 



^ 



? ^ w (^ (t) - m„ (t)) E ((/ (x, it) + Z)-f ( Xi (*)))' 



n 

Now if / e C&, |/| < 1, then LM 2 (t) < 4o/n. If / = Id, then LM 2 (t) < a£ (Z 2 ) /n. In both cases this shows, 
via ([oTT5|) . that (pH|) holds. □ 

Proof of Theorem \6.1 6 A Recall that w < a and that \w'\ < a' . 
We first show that for every t > and every / <G H , 

(6.16) (/,M*))=M/> **(*)) 

in R. First of all, we know that the projection 7r t : D ([0, oo), Pi (R)) — > Pi (R), given by 7Tt (f (•)) = ^ (t), is 
continuous at ^ (•) if and only if v (•) is continuous at t. Since \x (■) € C ([0, oo), Pi (R)) a.s. (by Corollary |6.10l) . 
the continuous mapping theorem and the fact that /i n (•) =4>„ /i (•) in D ([0, oo), Pi (R)) together imply that for 
every t > 0, 

Mn (*) =>n M (*) 

in Pi (R). Now for any / £ H, the function hf : Pi (R) — >• R, given by hf (v) — (f,v), is continuous. (This 
follows easily from the equivalence of (1) and (2) in Lemma 16. II ) Using the continuous mapping theorem, this 
shows (J6.16D . 

We introduce <?/ (x) := (E (/ (x + Z)) — f (x)) in order to abbreviate notation. What remains to be shown 
is that for every t > and every f £ H . 

t t 

(6.17) / (g f (x) w(x- m„ (s)) , fj, n (s)) ds => n / (g f (x) w (x - m (s)) , \i ($)) ds 
o o 

in R. First, let us show that for any f £ H, the function hf : V\ (R) — > R, given by 
hf {y) = {gf (x) w (x — M) ,v), is continuous, where M = J xv (dx). So suppose lim„_ i . 00 d\ (i/ n , v) = 0, and 
denote by M n the mean of v n , i.e. M n = J xv n (dx). We need to show that in this case 

lim \{g f (x) w(x- M n ) , v n ) - (g f (x) w (x - M) ,v)\ =0. 

n—>-oo 
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We use the triangle inequality to arrive at 

| (g f (x) w(x-M n ), v n ) - (g f (x) w (x - M) , v) \ < \ (g f (x) [w (x - M n ) - w (x - M)} , v n ) \ 

+ I (.9/ (%) w(x- M) , v n ) - {g f (x) w(x-M),v)\. 

The first term on the right hand side can be bounded above by 2a' \M n — M |. This term goes to zero as 
n — > oo, due to lim n _ i . 00 d\ (y n , v) = and the equivalence of (1) and (2) in Lemma 16.11 For every / G H, 
gf (x) w (x — M) is a bounded and continuous function, and consequently the second term goes to zero as n — > oo 
as well, due to linin^oo d\ (v n , v) = and the equivalence of (1) and (2) in Lemma |6. II Thus hf is continuous 
for every f £ H. 

Consequently the continuous mapping theorem and Lemma 16.71 together imply that 

(9f (x) V)(X- m„ (•)) , Mn (')) =*-n (9f (x) W (x - TO (•)) , (1 (•)) 

in D ([0, co),R). Lemma T6. 81 then implies that 

(g f (x) w(x— m n (s)) , fin (s)) ds ^> n / (gf (x) w (x - m (s)) , fi (s)) ds 



in D ([0,oo), R). The integral function on the right hand side is continuous, which implies that the one- 
dimensional distributions converge weakly in R, i.e. (|6.17p holds. □ 

Proof of Corollarv \6.13\ Fix t > and / £ H . Theorem 16.111 implies that 

A tJ (P-n (')) =>« 

in R. On the other hand, Theorem 16.121 implies that 

AtJ (fl n (•)) =*>« A t>f (fi (•)) 

in R. However, we know that a sequence cannot converge weakly to two different limits (Billingsley Section 
2]), and hence A t j (fi (•)) = almost surely. □ 

6.5 Uniqueness of the solution to the mean field equation 

In this subsection we prove Theorem 16. 141 and Corollarv l6.15l 

Proof of Theorem \6.14\ We have to show that there exists a constant c such that (16. lip holds. In order to do 
this we prove that there exists a constant c such that 



(6.18) d H (t) < d H (0) + c d H (s)ds, 



from which (|6.11[) follows using Gronwall's lemma. 

So let us now prove (|6.18|) . Denote by to 1 (t) and m 2 (i) the mean of fi 1 (t) and fi 2 (t) respectively: m l (t) = 
J xfj, 1 (t,dx) for i = 1,2. We know that /1 1 (•) and fi 2 (•) are solutions to the mean field equation, with initial 
conditions /ij and fi^ respectively, and so 

Atj {fi' (•)) = 

holds for all / G H, for i = 1, 2 (recall (|3.8|> ). Using this, the definition of the metric dH, and simple inequalities, 
we have 

dH(t)-sup|(/, M 1 (t))-(/, Ai 2 (i))| 
feH 

< sup|(/,/4)-(/,^o)| 
feH 



+ / sup\{(E(f(x + Z))-f(x))w(x~m 1 ( S )),fi 1 ( S )) 
J feH 


- ((E (/ (x + Z)) - f (x)) w(x-m 2 (a)) , fi 2 (*)> | ds. 
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The first term on the right hand side is exactly dn (0), and so what we have to show is that the suprcmum 
inside the integral is at most a constant multiple of dn (s). In order to abbreviate notation, we introduce 

g f (x):=(E(f(x + Z))-f(x)). 

We know that gid = E (Z) = 1 and that for every / 6 C&, |/| < 1, gj is continuous and \g/\ < 2. First we use 
the triangle inequality: 

SU P \(9f(x)w(x-m 1 (s)) ,^ ($)) - (g f (x)w(x-m 2 (s)) , y 2 (s))\ 

< sup \(g f (x)w(x-m 1 (s)) , y 1 (s)) - (g f (x) w (x - m 1 (s)) ,/J 2 (s))| 
feH 

+ sup I (.9/ (x) w(x~m 1 (s)) , /i 2 (s)) - (5/ (x) w (x - m 2 (s)) , ^ 2 (s)) I . 

We bound the two terms separately by a constant multiple of dn (s). Let us start with the first term. If / £ Cb, 
\f\ < L then gf (x)w (x — m 1 (s)) is continuous in x and its absolute value is at most 2a, so consequently 
the expression in the supremum is at most 2adn (s) ■ If / = Id, then the expression in the supremum is 
(tu(i- to 1 (s)) , /i 2 (s)) — (u> (x — m 1 (s)j ,/z 1 (s)) which is at most adn (s) since w (x — m 1 (s)j is continuous 
in x and |w;| < a. Thus the first term is at most 2adn (s). 

Now let us look at the second term. Using the fact that w' is bounded we have 



«' 



(x — m (s)) — w (x — m (s)) < a' \m (s) — m (s) < a! dn (s) i 



and since \gf\ < 2 for all / £ H , it follows that the second term is at most 2a' dn (s). Therefore this shows 
(jeTTB)) with c := 2a + 2a'. U 

Proof of Corollarv \6.15\ If dn (0) = then, due to Theorem 16. 14[ dn (t) — for all t > 0, and consequently 
y 1 (t) = y 2 (t) for all t > 0, so \i x (•) = /x 2 (•). D 

6.6 Proof of Theorem 16.51 

Before proving Theorem 16.51 let us present a result for general Polish space state spaces that we use during 
the proof. Suppose E is a Polish space and d is a complete metric on E. For x (•) € D ([0, 00), E), T > 0, and 
<5 > 0, let 

w (x (■) ,5,T) := sup d(x (t) ,a;(s)) . 

0<s,i<T,|s-i|<,5 

(We note that this w should not be confused with our jump rate function w — the distinction will be obvious 
from the context.) Then: 

Theorem 6.16. Suppose {X n (•)}„>! * s a sequence of cadlag E -valued processes, where E is Polish. {X n (•)}„.>! 
is C -relatively compact in D ([0, 00), E) if and only if the following two conditions hold: 

(a) For every T > and every £ > there exists a compact set K° = K^ £ C £ such that 

supP (X n (t) £ K° for some t <T) < e. 

n 

(b) For every T > and every e > there exists a 5 > such that 

lim sup P O (X n (■),S,T)>e)<e. 

n— >oo 

This theorem follows from Corollary 3.7.4, Remark 3.7.3, and Theorem 3.10.2 of Ethier and Kurtz |12) . 
Note: (a) is called the compact containment condition. 

Proof of Theorem \6.5\ To prove that {P„ (-)} n >i m C-relatively compact in D {\fi,oc),Vi (K)) we need to show 
that conditions (a) and (b) of Theorem 16 . 1 61 follow from conditions (i) and (ii) of Theorem l6.5l Notice that it is 
enough to demonstrate this for small e's. First, let us start with the compact containment condition. Let T > 
and e S (0, |) be fixed. To this T > 0, there exists a c = c (T) and a r = r (T) £ (0, 1) given by condition (i). 
For every m > 1, let e m :— ce/m 2 / T , where cr 1 — J2m>i m ~ 2 ^ T so that Yl m >i £m = £ - N° w by condition (i), 
to the fixed T and e m there exists a K m :— Kx.e m £ K such that 

m — _1 — T /- \1— T 

£ m (ce) 
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and 

supP ( sup P n (t, (-00, -K m ) U (K m , 00)) > e m I < s m . 

n \0<t<T J 

This implies that 

supP ( sup P n (t, (-00, -K m ) U (K m , 00)) > l/rn 2/r ] < £ m , 

n \0<t<T / 

since e m < l/m 2 / r . Now given the -K" m 's, we define the following set of probability measures: 
C° := |/i € Pi (M) : /1 ((-00, -Jf m ) U (iT m , 00)) < l/m 2/r for every m > l} 
(Of course C° = Cy. £ .) The choice of the if TO 's implies that 

sup P (P n (t) £ C° for some t < T) < e, 



since 



(P„ (t) ^ C° for some t < T) = P (3m > 1, 3i > T : P„ (t, (-00, -if ro ) U (X™, 00)) > 1/to 2/t ) 



= P I 3m > 1 : sup P n (t, (-00, -# m ) U (X m , 00)) > 1/to 2/t 

0<t<T 



<VP sup P„ (t, (-00, -lf m ) U (X m , to)) > 1/to 2/t 

00 



m— 1 

Consequently, 



supP (P n (t) i K° for some t<T) < e, 

n 

for if := C°. This if will be good as the compact set needed in condition (a) — what is left to show is that 
K° is compact in V\ (P). For this, it is enough to show that C° is relatively compact in V\ (P.). We know (see, 
for instance, Feng and Kurtz [13] Appendix D]) that C° C V\ (P) is relatively compact if and only if 



(6.19) lim sup / \x\ix{dx) = 0. 

N ^°° nec° J 

\x\>N 

So let us now check (|6.19p . Note that this is where the original form of Perkins' theorem [351 Theorem II. 4.1] 
is not enough and we have to use our modified version, Theorem 16.51 If N > K m , then 



/oo „ 

I re I /1 (dx) < sup 2, I 

\x\>N L - m [-K l + 1 ,-K l )u(K l ,K l + 1 ] 



\x\ /j, (dx) 



< sup S2Ki +1 h((-oo,-Ki)U(Ki,oo)) 

^ C ° l=m 

< V - (i + i) 2/T - 2 — 

-f^(ce) 1 ~ A ' P' T ' 

l—m * ' 

This last expression goes to as m — > 00, which shows (J6.19I) . and consequently we have shown that condition 
(i) implies condition (a). 

Now let us show that conditions (i) and (ii) imply condition (b). First we show that for every /eC(, (Pi (P)), 
{/ ° Pi (')}n>i ^ s C-relatively compact in D ([0, 00), K). By the characterization of C-relatively compactness 
we need to check conditions (a) and (b) of Theorem 16. 161 Due to the boundedness of /, (a) is immediate. Now 
let us show (b). Fix / e C b (Pi (P)), T > 0, and e > 0. Choose K° = K^ e as in (a). Define 

k m,i 

A = {h : Vi (P) ->■ R \h {(i) = J2 a H (fwP) > a i e R ' hi e c h, k > m * e z +} C C b (Pi (P)) . 
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Then A is an algebra containing the constant functions and separating points in V\ (R). So by the Stone- 
Weierstrass theorem (see for instance Willard |32j ) there exists an/igi such that 

sup \h{n)-f(ji)\ <e. 

If {Y n (■)}„>! an d {%n (')},i>i are C-relatively compact in D ([0, oo), R) then so are {aY n (•) + bZ n (•)}„>! an d 
{Y n (•) Z n (•)}„> i for any a, b € R (see the characterization of Theorem 16.161) . Therefore condition (ii) of the 
theorem implies that {h o P n (-)} n >i is C-relatively compact in D ([0, oo), R). Now using the characterization 
of Theorem 16.161 again, there exists a S > such that 

limsupP(u;(/ioP n (.), ( 5,T) > e) < e. 

n— »-oo 

To abbreviate notation, we introduce P„ ([0,T]) := {P„ (*):*€ [0,T)}. If s,£ < T, then 

|/(P n (t))-/(P n ( 5 ))| = |/(P„(t))-/(P n ( S ))|l[P„([0,T])Ciif ] 
+ |/(Pn(*))-/(Pn( S ))|l[P n ([0,T])^ir°] 
<2 sup |/i(/i)-/Gu)| + |MP n (f))-MP n (s))| 

+ 2||/|| oo l[P„([0,T])^/f°] 

and so by the definition of w (/ o P„ (•) , S, T) we have 

w (f o P„ (•) , S, T) < 2e + w (h o P„ (•) , 6, T) + 2 H/]^ 1 [P„ ([0, T]) £ ff°] . 

Therefore 

limsupP(u;(/oP„(-),(5,r)>3e) < limsupP (P„ ([0, T]) £ K°) + limsupP (w (h o P„ (•) , 5, T) > e) 

n— >-oo n— foo n— >oo 

< e + £ = 2e < 3e, 

by the choice of if and the fact that {h o P„ (-)} n >i is C-relatively compact in D ([0, oo), R). 

So let us recap what we have done up until now: we have shown that (i) implies the compact containment 
condition and that (i) and (ii) imply that for every /£ Cj (Pi (R)), {/ o P n (•)}„>! is C-relatively compact in 
D ([0, oo), R). These imply that {P n (-)} n >i * s relatively compact in D ([0, oo), R) (see Theorem 3.9.1 of Ethicr 
and Kurtz [T2]). What remains is to show that it is not only relatively compact, but also C-relatively compact. 

We now show that (i) and (ii) imply (b). Let us define the metric d := min{di, 1} and let 

w d (P„ (•) , S, T) := sup d (P„ (t) , P„ (sj) . 

0<sJ<T,\s-t\<S 



Let T > and e > be fixed, and choose K° as in (a) again. Since K° is compact, we can choose yn £ K°, 
i=l,,.,,M, so that K° C uf^B (m,e), where 

B (n, e) = {v e Pi (R) : di (^, ^) < e} • 

Since £ < 1, 

B (ji, e) = {v£ Vi (R) : d (/x, v) < e} . 

Now let /i(/i) := d(m,n). Clearly /« e C 6 (Pi(R)). We have proved that for every / G C b (Pi 
{/ ° Pi (")}n>i * s C-relatively compact in D ([0, oo),R), which implies that there exists a 6 > such that 

M 

(6.20) ^ limsupP (w {fi o P„ (•) ,6,T) >e)<e. 

. - n— >oo 
%— 1 

If ^/, ^ G X°, choose /^j so that d (z/, ^) < e. Then 

d(jt,i/) < d(/i,Mj) +d0*j,i/) < |d(Afr,^-)-d()U.j,i/)| -\-2d(ji j ,v) < max|/i (m) - /« (i/)| + 2e. 

Now let s,t < T: the above inequality implies 

d (P„ (t) , P n (*)) = d (P„ (t) , P„ (*)) 1 [P„ ([0, T]) C K°] + d (P„ (t) , P„ (s)) 1 [P„ ([0, T]) ^ A- ] 
< max |/ 2 o P„ (t) - f t o P n ( s )| + 2e + 1 [P n ([0, T]) ^ K°] , 
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where we used that d < 1. Consequently 

w d (P„ (•) ,S,T)< maxw (/,; o P n (•) , <5, T) + 2e + 1 [P„ ([0, T\) £ K°] . 

Now note that {w (P n (•) , S, T) > 3s} and {wd (P n (■) , 5, T) > 3e} are exactly the same events, and consequently 

limsupP (w(P n {-),S,T) > 3e) = lim sup P (w d (P n (•) , 5, T) > 3e) 

n— >oo n— >-oo 

< limsupP(maxw(/ i oF„(-),,5,T) > e) + limsupP (P n ([0,T]) £ K°) 



n—>oo \ i 

M 

< limsu P ^P( W {fi o P n (-),S,T) >e) + limsupP (P„ ([0,T]) £ if ) 

z— 1 

< e + e = 2e < 3e, 

using (|6.20p and the choice of K°. This shows condition (b), and therefore completes the proof. □ 

7 Simulation Results 

As mentioned before, we believe that the empirical measure of the particle system for a large number of particles 
is well approximated by the mean field equation for a wide class of jump rate functions w. In this section we 
support this conjecture via simulation results in cases when Theorem l3.3l does not apply due to the assumptions 
made on the jump rate function w. Wc test the simulated particle distribution around the mean position against 
the mean field stationary formulas calculated in Examples 15.11 and 15.21 We see that in both cases the particle 
system moves as a traveling wave with speed very close to the one predicted by the mean field equation, and 
the empirical density of the particle system fluctuates around the stationary density (J3.5D . 

Suppose the stationary density derived from the mean field approximation is p. We fix the number of 
particles (n — 10 4 in the examples) and run the simulation for the finite number of particles for time t e [0, T]. 
At t = we suppose that every particle is at 0. For every time t there exists an empirical density histogram 
p n (;,t) of the particles around their empirical mean position, and from this we derive the time average for the 
empirical density histograms 

T 
Pn,T (x) = — / pn (x, t) dt. 



Since after a certain relaxation time to the distribution of the particles around their mean position is very close 
to the stationary distribution, if we choose T big enough compared to to then we do not need to know the 
exact value of to, and p n will be very close to the stationary distribution of the n particles around their mean 
position. In both examples considered, we show plots of the time average p n .T, and we also give a snapshot 
of the empirical density histogram p n (-,t) at a given time t to give the reader an idea of how this fluctuates 
around the stationary density. 

One question remains: how do we create the empirical density histogram? We choose an interval / = [ao, &i] 
so that Jj p (x) dx p» 1 and divide / into N = ai ^ a ° intervals of length h n . In our cases the tails of the 
distribution are always at least exponential, so if I is chosen big enough then we do not have to deal with what 
happens outside /. From this the empirical density histogram of the particles around their mean position at 
time t is 

p n (x,t) = —— \{i : Xi (t) -m n (t) e Aj,i = l,...,n}\ , if a; € A 3 -,j = 1, ...,n, 
nh n 

where Aj = [ag + (j — 1) h n , ao +jh n ). We know that if X\ if) , . . . , x n (t) are independent identically distributed 
random variables, then if lim„_ i . 00 h n = and limn^oo nh n = oo then the empirical density histogram converges 
at every continuity point of the density as n — » oo. However, in our case the Xi (t) random variables are not 
independent. Nonetheless, we choose N = 2y/n and therefore h n = a \~?- a so that the criteria above is satisfied. 

7.1 Jump rate function is exponential 

Let w (x) = e~@ x as in Example 15.11 in this case the stationary mean field density is given by (|3.7[) . Figure 
2] shows a plot of pp in blue and the time average of the empirical density histograms p n ,T,p in red with the 
following parameters: ft = 1, n = 10 4 , ao = —10, a\ = 10,T = 1000. We conclude that the two fit extremely well 
and note that simulations show the same for different values of the parameter /3. Figure O shows the empirical 
density histogram p n $ (-,t) at time t = 500 with the other parameters the same as in Figure [H Finally, Figure 
[6] demonstrates that the particle system moves at a constant velocity c predicted by the mean field model. 
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n=10 4 , 0=1, T= 1000 
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Figure 4: Time average of empirical density histograms (red curve) in the case when w is exponential. Mean 
held stationary distribution is shown in blue. 



n=10 4 , j8=l, t=500 




Figure 5: Snapshot of empirical density histogram at time t = 500 (red curve) in the case when w is exponential. 
Mean held stationary distribution is shown in blue. 
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Mean of travelling wave 
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Figure 6: Mean position of the particles in the case when w is exponential. Blue line is from the mean field 
result, red line is from simulation with n = 10 4 particles. 
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Figure 7: Time average of empirical density histograms (red curve) in the case when w is a step function. Mean 
field stationary distribution is shown in blue. 
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n=10 4 , a=2, b=l, t=500 
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Figure 8: Snapshot of empirical density histogram at time t = 500 (red curve) in the case when w is a step 
function. Mean field stationary distribution is shown in blue. 
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Figure 9: Mean position of the particles in the case when w is a step function. Blue line is from the mean field 
result, red line is from simulation with n = 10 particles. 
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7.2 Jump rate function is a step function 

Let us consider the jump rate function analyzed in Example 15.21 when w is a step function. In this case the 
stationary mean field density is given by (I5.7[) . Figure [7] shows a plot of p a .b in blue and the time average of the 
empirical density histograms pn.T.a.b in red with the following parameters: a = 2, b = 1, n = 10 4 , a^ = —10, a\ = 
10, T = 1000. Once again we conclude that the two fit extremely well and note that simulations show the same 
for different values of the parameters a and b. Figure [5] shows the empirical density histogram p n .a.b {•> t) at time 
t = 500 with the other parameters the same as in Figure [7] Finally, Figure [5] demonstrates that the particle 
system moves at a constant velocity c predicted by the mean field model. 
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